Generalised state spaces and non-locality in fault tolerant quantum computing 

schemes. 
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We develop connections between generalised notions of entanglement and quantum computational 
devices where the measurements available are restricted, either because they are noisy and/or be- 
cause by design they are only along Pauli directions. By considering restricted measurements one 
can (by considering the dual positive operators) construct single particle state spaces that are dif- 
ferent to the usual quantum state space. This leads to a modified notion of entanglement that can 
be very different to the quantum version (for example, Bell states can become separable). We use 
this approach to develop alternative methods of classical simulation that have strong connections to 
the study of non-local correlations: we construct noisy quantum computers that admit operations 
outside the Clifford set and can generate some forms of multiparty quantum entanglement, but are 
otherwise classical in that they can be efficiently simulated classically and cannot generate non-local 
statistics. Although the approach provides new regimes of noisy quantum evolution that can be 
efficiently simulated classically, it does not appear to lead to significant reductions of existing upper 
bounds to fault tolerance thresholds for common noise models. 



A scalable quantum device is useful for non-classical 
computation if it cannot be simulated efficiently classi- 
cally, as any such device can always be used compute 
its own observable behaviour. It is hence important to 
understand when a quantum device can or cannot be ef- 
ficiently simulated classically. In one direction this prob- 
lem is hard: while it is conjectured to be the case that 
ideal quantum computers are not classically tractable, 
proving this rigorously would resolve longstanding open 
problems in theoretical computer science. 

The opposite direction is usually more amenable to 
attack. For example, it is possible to construct various 
families of quantum system that can be efficiently simu- 
lated classically |29|, including ones that contain signif- 
icant amounts of multiparty entanglement [15l l3fj ■ Of 
particular interest are the noise levels required before a 
proposed quantum computer becomes so noisy that it 
can be efficiently simulated classically [12|. These noise 
levels are important because they represent the noise lev- 
els at which the power to do non-classical computation is 
lost, and hence are strongly related to the topic of fault 
tolerant quantum computation. Studies in that area have 
shown that for a variety of noise models there is a so- 
called quantum fault tolerant threshold noise level, below 
which it is possible for a noisy quantum computer to sim- 
ulate an ideal quantum computer efficiently. However, in 
almost all cases the values of this threshold are unknown. 
Lower bounds to the thresholds have been obtained us- 
ing constructive methods for quantum error correction 
that are robust to imperfections (see e.g. [llU). Upper 
bounds have been obtained in two ways - one approach is 
to argue that once the noise is too high the output cannot 
contain sufficient information about the input [9|, |l3|, [l4 1 , 
another approach is to argue that above a certain noise 
level the system can be efficiently simulated classically 
[3J-4sL ll2L Il6| . It is this latter 'classical' approach that we 



will follow here. 

Within the classical approach there are two methods 
for simulating noisy quantum computers that will be rel- 
evant to us. The first is a method of Harrow & Nielsen 
(HN) |3[ , which argues that if the noise level is too high 
then the entangling gates in the device (for instance the 
CNOT gates) will become non-entangling, and that once 
this happens the probability distribution of measurement 
outcomes can be efficiently sampled classically. The sec- 
ond method uses the Gottesman-Knill (GK) theorem [15| 
to show that if the noise levels are high enough, all the el- 
ementary operations in the devices will enter the so-called 
stabilizer or Clifford set, and hence can be efficiently sim- 
ulated using the stabilizer formalism. The GK theorem 
is especially relevant for fault tolerant quantum compu- 
tation because the most well understood fault tolerance 
proposals are built using stabilizer operations with an 
additional non-stabilizer resource. 

The aim of the present work is to use a generalised 
notion of entanglement [23( (in a sense described in the 
next section) to apply a modified version of the Harrow 
& Nielsen method to devices where there are restrictions 
on the measurements available. Such restrictions arise 
naturally in fault tolerant quantum computation for two 
reasons. Firstly, most fault tolerance schemes only use 
measurements of certain observables (in particular Pauli 
measurements or their generalisations in higher dimen- 
sions). Secondly, any measurements that are available 
are always prone to noise. It turns out that such re- 
strictions can allow one to consider non-quantum single 
particle state spaces, and thereby a new notion of entan- 
glement. 

We will show that this approach forms a bridge be- 
tween the foundations of quantum theory and the study 
of noisy quantum computation. For instance, the ap- 
proach shows that for quantum computers using phys- 



ical measurements from the Pauli operators only, non- 
classical computation is only possible if the dynamics is 
either non-positive w.r.t. the non-quantum state space, 
or can generate non-local correlations. It also motivates 
the construction of generalised probabilistic theories [24[ 
that are inspired by fault tolerant quantum computation 
schemes (although in our primary case the generalised 
theory turns out to be simply classical), and provides 
regimes of classically tractable noisy quantum evolution 
that appear to fall outside the scope of existing meth- 
ods. It also falls under the programme initiated in [23[ 
to investigate so-called generalised notions of entangle- 
ment, and motivates some interesting questions regard- 
ing the contrast between different forms of generalised 
multiparty entanglement. 

Any readers not interested in technical details may ig- 
nore the appendices and skim the detailed calculations 
in the latter half of the paper. In the first few sections 
we begin with some general motivation and properties of 
the restriction to Pauli measurements. 



GENERALISED NOTIONS OF 

ENTANGLEMENT, AND APPLICATION TO 

THE STABILIZER FORMALISM. 



For our purposes a classical simulation of a quantum 
device is a classical algorithm that samples the proba- 
bility distribution of measurement outcomes that occur 
in the device (we do not require the ability to calculate 
these probabilities directly). For a general quantum sys- 
tem this sampling is believed to be impossible to achieve 
efficiently, due to the conjectured superiority of the quan- 
tum computer. However, if the quantum gates in the 
system are so noisy that they are incapable of entangling 
separable input states, then Harrow & Nielsen (HN) [3[ 
showed that it can be efficiently simulated classically. It 
is helpful to illustrate their method in order to motivate 
the rest of this note. Consider a noisy two-particle gate 
that has this property (i.e. it is incapable of entangling 
separable inputs) and suppose that it acts on the first 
two input qubits in the device. As the gate is by hypoth- 
esis separable, it will give an output state p that is also 
separable [171 ] : 



p = y^PiQj 0h, 



(1) 



where a^ , hi are states on the first and second qubits re- 
spectively, and "Pi is a probability distribution. In such 
cases the algorithm of [3[ does not store the whole state, 
a task which could require exponentially large memory, 
but instead classically samples the probability distribu- 
tion {pi}, and then only stores a,j, bj corresponding to the 
result j of this sampling. This process is then repeated 
for every gate in the quantum circuit. When the last 



gate in the circuit has been applied, the computer mem- 
ory will have in storage N single qubit states (where N 
is the number of qubits), requiring modest memory. To 
complete the simulation we sample what happens when 
single qubit measurements are performed on these states. 
It is intuitively clear that this process, if performed to 
sufficiently high accuracy, will lead to a final outcome 
that is sampled from the same distribution as would be 
obtained if the quantum device were actually built. A 
complete analysis of the algorithm requires an analysis 
of the errors induced by finite precision storage and sam- 
pling methods on classical computers. However, these 
details will not concern us, as the analysis presented in 
|3| applies with only minor modification to the alterna- 
tive notion of entanglement that we consider below. It is 
also useful to note that the HN method also applies to 
adaptive schemes, where measurement outcomes are use 
to determine future choices of gate. This property also 
carries over to the generalised notions of entanglement 
that we now consider. 

The ordinary notion of quantum separability relies on 
the conventional notion of a single particle quantum state 
space. In a manner similar to [23|, we will consider a 
redefinition of the single particle state space away from 
the 'quantum' version, thereby giving a different notion 
of 'separability' under which one can apply the algorithm 
of [3[ . As a starting point for our generalised state spaces 
we will use the following characterisation of the quantum 
set Qd of dimension d single particle states. We may 
define Qd as the set of complex d x d matrices that give 
valid probability distributions for all POVMs using the 
Born rule: 



Q d = { p | tv{Mp} > ,VI > M > 0} 



(2) 



This definition can be generalised: we may define a sin- 
gle particle 'state space' as the set of matrices that give 
valid probability distributions for the set of measure- 
ments that we have available. If the set of measure- 
ments is restricted, then the set of 'states' defined in 
this way will be larger than quantum - it will contain 
matrices that are not admissible as quantum states be- 
cause they needn't be positive for measurements outside 
the restricted set. If we are considering quantum devices 
with such restricted measurements, then we may apply 
the HN algorithm to efficiently simulate gates that are 
separable w.r.t. these modified state spaces - the modi- 
fied state spaces still allow us to sample the probability 
distribution obtained from the restricted measurements 
available. As the gates that are separable will be differ- 
ent to the quantum-separable ones, this leads to different 
regimes of classically efficient simulation. 

There are various natural ways to select a set of "al- 
lowed" measurements. One possibility is to consider a 
single POVM (which may or may not be noisy), and then 
construct a set of measurements by acting upon it with 



a symmetry group of single particle unitaries. A proto- 
typical example is inspired by stabilizer architectures for 
fault tolerant quantum computation: in these proposals 
the measurements are restricted to the Pauli axes, which 
can be generated from a single Pauli measurement using 
the 'Clifford group' [15j. It is this restriction that we 



consider in the next section. 



BLOCH CUBES 

We begin by analysing the restriction to Pauli X, Y, Z 
measurements. It is easy to show that in this case the set 
of valid states is a 'cube', the corners of which are given 
by eight Bloch vectors 

{(1, 1, 1), (1, 1, -1), (1, -1, 1), .., (-1, -1, -1)}. (3) 

To see this, note that because the Pauli matrices form a 
basis, any 2 dimensional matrix can be written as 



1 



{al + bX + cY + dZ) 



(4) 



for some expansion coefficient a, 6, c, d. Under the Born 
rule, probability of getting +1 on measuring Z is (a + 
d)/2, and the probability of getting the -1 outcome is 
(a — d)/2. Hence for Z measurements alone any matrix 
with a = 1 and d G [—1,-1-1] gives a valid probability 
distribution. Similar conclusions hold for X, Y measure- 
ments, and so we find that any matrix in the 'Bloch cube' 
with a = 1 and b, c, d € [—1,-1-1] is admissible (see fig. 
([1]), and also reference 2l| for some related aspects of 



this geometry for hidden variable models). 

Now that we have established these cubes of 'single 
particle states' for X, Y, Z measurements, let us consider 
the implications of using them to define a modified notion 
of entanglement. We define a separable two particle state 
as one that is a convex combination of products of valid 
"single particle states" . i.e. a separable two particle state 
is now one that can be written as 



p = y^ffiQj <S>6i 



(5) 



where a^ , bi are single qubit operators represented by vec- 
tors from the Bloch cube, not just positive states from 
the Bloch sphere. 

If we have a quantum device that can only perform 
Pauli X, Y, Z measurements, and if the gates in the device 
take separable Bloch cube input states to cube-separable 
output states, then we can apply the HN algorithm with- 
out modification to efficiently simulate the system clas- 
sically. This is particularly relevant for the 'magic state' 
architectures [lfj, [ll( . These devices have the following 
attributes: 

• The ability to measure in the Pauli bases. 




FIG. 1: The Bloch sphere represents the set of positive op- 
erators when all POVMs are included. If the Born rule is 
used with only Pauli measurements, however, then a cube of 
operators becomes positive. The eight corners of the cube 
are given by Bloch vectors (s,t, u) with s,t,u £ ±1, and at- 
tain any set of possible outcomes for Pauli measurements. 
Hence we describe Pauli measurements as 'operator compat- 
ible'. Any pure quantum state that is not a Pauli eigenstate 
lies on the surface of the sphere but strictly inside the cube. 



• The ability to prepare single qubit states that are 
not mixtures of Pauli eigenstatcs. 

• The ability to do single qubit unitaries from the 
Clifford group - these unitaries are all symmetries 
of the Bloch-cube. 

• The ability to do a single two qubit operation, typ- 
ically a CNOT or CSIGN gate, or something else 
sufficient for generating the entire Clifford group on 
n-qubits. In these notes we shall only consider the 
CSIGN gate, which is represented by the unitary 
matrix |00)(00| + |01)(01| + |10)(10|-|11)(11|. The 
results that we will obtain can be applied to other 
entangling Clifford group gates by modifying the 
local X, Y, Z reference frame appropriately. 

The first three of these attributes fit nicely into the pic- 
ture of 'cube states' - the Pauli directions are the al- 
lowed measurements, single qubit states from the Bloch 
sphere are contained entirely within the Bloch-cube, and 
the single qubit Clifford unitaries are symmetries of the 
Bloch-cube. 

The only operations that fail to preserve cube- 
separability are the last ones - the two qubit Clifford 
operations. That this is true is a special case of a result 
in [221 ] in the context of box-world generalised theories, 
but for convenience in Appendix II we provide an alter- 
native proof that is tailored to our problem. 

We hence must answer the following question: how 
much noise does it take to make a CSIGN gate cube- 
separability preserving! If we have such noise levels then 



wc know that we can simulate the system efficiently clas- 
sically, and hence these noise levels should supply upper 
bounds to the fault tolerance thresholds. 

Because the regime covered by cube-separability is dif- 
ferent from both the GK regime (we admit all single qubit 
states), and the the original HN approach (there are en- 
tangled states that are cube-separable) , there is a chance 
that it can simulate some systems efficiently classically 
that are not covered by naive applications of the other 
two approaches. We provide evidence for the existence 
of such regimes below, although it is possible that more 
sophisticated versions of these existing algorithms (per- 
haps allowing entangled 'clusters', as in |12| ) could cover 
any regime that we propose here. 

In considering non-quantum states, we are exploring 
a type of generalised probabilistic theory 24| that can 
lie outside conventional quantum mechanics. However, 
we are still using a large amount of quantum structure: 
while the cubes of states that we consider are effectively 
(3-measurement, 2-outcome) 'boxes' from box-world the- 



ories [241 ] , we take our dynamics directly from noisy quan- 
tum mechanics with a given Hilbert space. In some cases 
the generalised theory that we are trying to construct 
is nevertheless effectively classical — most of the cal- 
culations in the paper in fact involve identifying noisy 
quantum gates that can be considered valid operations 
in a classical theory. However, it is because we are con- 
sidering noisy quantum gates that we choose to retain 
the description of the cubes in terms of Pauli operators, 
rather than using tables of probabilities that arc more 
natural in generalised theories. 



CONTRAST BETWEEN 

CUBE-ENTANGLEMENT AND QUANTUM 

ENTANGLEMENT, CONNECTIONS TO 

NONLOCALITY 

The main technical problem underlying these investi- 
gations is to decide when a noisy quantum operation is 
cube-separable. This section presents a number of obser- 
vations that are useful for understanding the context of 
this problem, and its relationship to ordinary quantum 
entanglement and non-locality. 

Definition 1: Operator compatibility. We define a 
set of quantum measurements to be operator compat- 
ible if one can find, for all combinations of measurement 
outcomes, an operator (usually a non-positive operator) 
that achieves those outcomes dctcrministically under the 
Born rule. I 

For instance, X, Y, Z measurements are operator com- 
patible: suppose that you wish to get values e, /, g for 
measurement of X, Y, Z respectively, where e,f,g G ±1. 
Then the operator 1/2(7 + eX + fY + gZ) achieves this 
under the Born rule. These operators are just the cor- 
ners of the Bloch cube. The same argument extends to 



any set of measurements described by traceless orthog- 
onal observables (in Hilbcrt-Schmidt inner product), as 
one can write out the desired operator as a suitable linear 
sum of the observables. 

While this notion of 'compatibility' for sets of quantum 
measurements does not occur in general, it is not difficult 
to generically characterise the set of operator compatible 
measurements: 

Lemma 2: Characterisation of operator com- 
patible measurements. On quantum systems of di- 
mension d a set of N distinct POVMs can only be opera- 
tor compatible if the total number of outcomes summed 
over all the POVMs is < d 2 + N - 1. Generically cho- 
sen measurements satisfying this constraint are operator 
compatible. 

Proof: Consider N POVMs Pi, P 2 .., P N of a d dimen- 
sional quantum system, and suppose that these POVMs 
have m i, TO2.., m^v outcomes respectively. A trace-one 
hermitian matrix A of dimension d x d has d 2 — 1 free 
parameters. Under the Born rule only one of the POVM 
elements in each POVM must have probability one, the 
rest must have probability zero. However, one of these 
rrii constraints is guaranteed by normalisation. Hence 
each measurement Pi leads to m,; — 1 independent linear 
constraints on A. Hence we will generically require that: 

^2 m l -l <d 2 -1 

t=l,..,JV 



^2 m i < d 2 + N - ! 



t=l,..,JV 

In particular, if there are d measurement outcomes for 
each measurement, and if AT > 1, then this condition 
reduces to (for N = 1 the condition is trivial anyway): 

< d 2 - N.d + {N - 1) 
=> 0< (d - (N - l))(d - I) 
=> N <d+l 

The property of operator compatibility has important 
consequences if the measurements being considered arc 
tomographically complete, i.e. if the values of the proba- 
bilities of each measurement outcome arc sufficient to fix 
the state. In these cases a matrix is separable w.r.t. to 
the single party state space (as constructed via equation 
(|2J)) iff it satisfies a local hidden variable model: 

Lemma 3: A multiparticle state is separable 
w.r.t. the state spaces corresponding to tomo- 
graphically complete operator compatible mea- 
surements iff there is a local hidden variable 
model for those measurements made upon the 
state. 

Proof: We consider only two particles for simplicity, 
the argument straightforwardly to multiparticle systems. 
As the measurements are tomographically complete, this 



means that a state A fixes uniquely and is uniquely fixed 
by the traces of the matrix with the measurement op- 
erators. Hence the state is exactly characterised by the 
probability distribution of measurement outcomes. 

If the probability distribution satisfies a local hidden 
variable model [18|, then by definition it can be written 
as 

tr{A((P a )^(P b ) 3 )} = J2pWp(^X)q(j\b,X) (6) 

A 

where (P a )i and (Pb)j are the ith and jth elements of 
POVMs P a ,Pb respectively, p(X) is a probability distri- 
bution over a hidden variable, and p(i\a, A) and q{j\b, A) 
are probability distributions for local outcomes. These 
local probability distributions can in turn be decomposed 
as mixtures of local deterministic distributions, where a 
single deterministic outcome is specified for each mea- 
surement choice. The deterministic distributions in turn 
are matched by the extreme points of the state space (as 
defined via eq. (|2|)), and so a matrix A is separable w.r.t. 
the state space defined by the measurements iff it satisfies 
a local hidden variable model for those measurements. I 

In particular for the case of qubit Pauli measurements 
a quantum state is cube-separable iff it has a local hid- 
den variable model for Pauli measurements. If a set of 
allowed measurements is not 'operator compatible', then 
this opens up the possibility that there is a distinction 
between non-locality and non-separability - just as has 
been debated in the setting of conventional quantum the- 
ory [jj|. 

The connection between hidden variable models and 
non-classical computation is still an unresolved question 
(see e.g. [25j] for other investigations in this direction). 
The property of 'operator compatibility' sheds partial 
light on this question for machines restricted to Pauli 
measurements. As we can classically efficiently simulate 
any dynamics that preserve separability of the 'cubes' 
corresponding to the operator compatible measurements, 
we have that: 

Corollary 4: Quantum devices using only tomo- 
graphically complete operator compatible mea- 
surements cannot perform non-classical compu- 
tation if the quantum dynamics considered as dy- 
namics of the modified state spaces cannot gener- 
ate negative or non-local measurement probabili- 
ties (as computed via the Born rule) when acting 
upon states from the modified state space. 

Note that in this corollary we need to exclude the possi- 
bility of having negative probabilities because even single 
particle unitaries (not from the Clifford group) can rotate 
cube states to matrices that are not within the cubes. 

Lemma 5: Bell states of 2-qubits are cube sep- 
arable, hence there are quantum operations that 
are cube-separable, despite being entangling. 



Proof: It is well known that Bell states have a LHV 
model for A, Y, Z measurements (one way of proving this 
is presented later). Hence any quantum operation that 
creates Bell-diagonal two qubit states (e.g. by throwing 
the input away and creating a bell state in its place) will 
be separable in the cube setting, even though may be 
entangling in the usual quantum setting. ■ 

While Lemma 5 establishes a difference between cube- 
separable operations and entangling operations, opera- 
tions that create Bell diagonal states are entanglement 
breaking (they break entanglement between the two- 
qubits and the rest of the universe [19() and so they 
cannot be used to generate multiparty entanglement by 
themselves (and indeed they can be classically efficiently 
simulated \^). In Lemma 8 we will present examples 
of quantum operations that are cube-separable despite 
being capable of generating some form of multi-particle 
entanglement. However, first let us observe the following: 

Lemma 6: Around almost any quantum prod- 
uct pure state of many qubits, there is a region 
of cube-separable states. 

Proof: Consider a pure quantum state a that is not an 
cigenstate of the Pauli X, Y, Z operators. Such a state 
can be represented as a mixture of cube-pure state with 
strictly non-zero weights for all cube-pure states. Con- 
sider a small Hcrmitian perturbation A to products of 
such pure quantum states, to give an operator a + A. 
As cube-pure states form an overcomplete basis for the 
space, A can be expanded as a real combination of prod- 
ucts of cube-pure states, possibly with negative weights. 
However, if the norm of A is small enough, these nega- 
tive weights will be small enough not to take the product 
pure state a out of the cube separable states (each non- 
Pauli pure state lies on the Bloch sphere but is strictly 
inside the cube - sec Fig|T])). ■ 

A quantum state is said to be genuinely A-party en- 
tangled if it cannot be written as a probabilistic mixture 
of states that are products of entangled states of less than 
N parties. In the case of three parties, for example, it 
has been shown [20( that there are two classes of 3-qubit 
entanglement - the 'W and GHZ class entangled states, 
and that examples of such states can be found that are 
arbitrarily close to product states. Hence wc have the 
following observation: 

Corollary 7: There are genuinely multi- 
particle quantum-entangled states that are cube- 
separable. 

Proof: Both W and GHZ class 3-qubit states can be 
found that are arbitrarily close to product states. Hence 
the previous lemma shows that there are W and GHZ 
class entangled states that are cube-separable. ■ 

This argument suggests that it is quite likely that a 
variety of genuinely multipartite entangled states can be 
found that are cube separable, and hence have a LHV 



model for X,Y,Z measurements. However, as the number 
of qubits increases the size of the cube-separable ball will 
decrease. 

However, as we show in the next lemma, it is possible 
to construct noisy quantum gates that are not quantum- 
separability preserving (i.e. they can entangle product 
input quantum states), are not entanglement breaking, 
and that are not Clifford operations. The first two of 
these properties mean that these operations can act on 
systems of product input particles to generate entangle- 
ment of increasing numbers of particles. The final prop- 
erty means that these entangled states can be chosen to 
be non-Clifford. However, it is not yet clear to us whether 
the entanglement generated by these cube-separable but 
not quantum-separable operations can become arbitrar- 
ily long range, and this is something that warrants fur- 
ther investigation. It is important to note however that 
quantum Bell states are cube-separable, so they can in 
principle be included as a free resource to help generate 
multiparticle entanglement. 

Lemma 8: There are noisy quantum gates that 
are cube separability preserving but not (a) quan- 
tum separability preserving, (b) entanglement 
breaking, or (c) Clifford operations. 

Proof: There is a semi-systematic way of generating a 
family of quantum operations with this property. Con- 
sider the magic 'T-state' [1CJ - the qubit with Bloch vec- 
tor ^/l/3(l, 1,1). Let this state be denoted \T) and the 
orthogonal qubit state be denoted IT). Then we may con- 
struct a Choi-Jamiolkowski (CJ) [27| state for a 2-qubit 
quantum operation acting upon qubits A, B as follows. 
Let Al, A2 be the two qubits of the CJ state correspond- 
ing to qubit A, and Bl, B1 be the two qubits of the CJ 
state corresponding to qubit B. Consider CJ state made 
by mixing with probabilities 1/2 + e and 1/2 — e respec- 
tively the following two normalised states: 

(a\T A1 T A2 T B2 ) + f3\T A1 T A2 f B2 )) <g> I B1 /2 
{l\T A1 T A2 T B2 ) + 5\T A1 T A2 T B2 )) <g> I B1 /2 (7) 

where we abuse notation slightly to keep the notation un- 
cluttered - the first terms are written in 'coherent' form, 
whereas the state on particle Bl is written in operator 
form - ideally we should have written both as operators. 
However, it is not difficult to show that provided that 



1 



b + <0M 2 + ( 



i 



(8) 



that the reduced state of particles A1,B1 is maximally 
mixed. Hence the mixture is a valid CJ state for a trace 
preserving quantum operation [27[. But if both |a|,|7| 
are close enough to 1, the output state of this operation 
is close to a mixture of T A2 T B2 and T A2 T B2 , regardless of 
the input state. By linearity this extends to input cube- 
states that are not quantum - so for all input cube states, 
the output states will be close to a mixture of T A2 T B2 



and T A2 T B2 . This means that provided that we pick 
| a |, J7J close enough to 1 while maintaining condition (|5J) 
(which we can do provided that e is close enough to 0) the 
operation is cube-separable, because as explained earlier, 
states close enough to non-Pauli pure product states (and 
hence states close to mixtures of T^T^ and T A2 T B2 ) are 
cube-separable. The output qubit A2 is also always close 
to T A2 , and hence the quantum operation represented by 
equation ([7]) is not a Clifford operation, as together with 
the Clifford resources required to do teleportation it can 
generate T A2 . 

We would like to argue that the operations in equa- 
tion can be chosen such that they are neither entan- 
glement breaking nor separability preserving. They defi- 
nitely aren't entanglement breaking or separable because 
creating the CJ state involves generating entanglement - 
measuring T, f parity on A2B2 of the CJ state distills a 
GHZ-like state, similarly measuring parity on A1A2 does 
the same. Hence the operation is neither separable nor 
entanglement breaking as it is entangled across both the 
1 : 2 split and the A : B split. 

However, to ensure that the gates that we define do 
not fall under the regime covered by the Harrow & 
Nielsen algorithm [3j, we must also show that they are 
not separability-preserving, i.e. we must show that there 
are input product states that are entangled by the gate, 
even without the use of extra ancillas that would be re- 
quired to make the CJ state. It can be shown by ex- 
plicit calculation (using the PPT criterion [26|) that any 
product quantum input states with the A particle in 
l/y/2{\T) + \f)) A are taken to non-PPT outputs pro- 
vided that e > 0. ■ 



DYNAMICAL QUANTUM GATES AND 

NOTATION FOR REPRESENTING 

TWO-2LEVEL OPERATORS 

Most of the remaining sections of the paper will be 
concerned with determining the noise levels required to 
turn a CSIGN gate into a separable operation with re- 
spect to the modified notions of entanglement that we 
consider. In this section we lay out some of our notation. 
The same notation will be used both for the analysis of 
cube-separability, as well as the analysis of the rescaled 
Bloch-sphere that appears later. 

For our purposes the action of the CSIGN gates is most 
easily represented by its action on Pauli operators. It is 
hence convenient to represent the state of two cubes in 
the basis of products of Pauli operators. Let a — I, <j\ = 
X, a 2 = Y, and 03 = Z. Any two-cube operator can be 
represented as: 



A = y^ AjjOj <E> o-j 



(9) 



It is convenient to represent the set of coefficients Aij as 



a 4 x 4 matrix where the matrix elements refer to the 
coefficient of the Pauli operator expansion as follows: 

/ II IX IY IZ \ 

XI XX XY XZ 

YI YX YY YZ 
\ ZI ZX ZY ZZ ) 

e.g. the row 1, column 2 matrix element is the coefficient 
of the I <8> X operator in the expansion. For example, an 
explicit calculation shows that the quantum Bell state 
l/v^dOO) + 1 11) ) has the expansion: 

/ 1 0\ 
1 1 
4 0-10 

\00 1/ 

whereas a product state of two cube states is given by: 

/ 1 x 2 TJ2 z 2 \ 
1 X\ 

4 2/i • • 

V * • • ■ j 

where the "." elements are given by the product of the 
first element of the row, and the first element of the col- 
umn. - e.g. element (2,3) in the matrix is given by Xiy 2 . 
The "pure product" states in the cube picture are ones 
where all the x\, x 2l J/i, 2/2, ■• are either +1 or -1. In many 
cases we will ignore normalisation factors such as 1/4. 

This representation provides one way of seeing that 
the Bell state 1/V2(|00) + 1 1 1) ) is separable in the cube 
picture - it is given by a uniform average of all cube pure 
product states that satisfy the constraint x\ = x 2 ,y\ = 
~ 2/2) Zi = z 2 (this then extends to all the Bell states, 
because they are related to each other by Pauli unitarics 
on one side). 

Most of the calculations involving cubes in this pa- 
per will involve first ensuring that the output of a noisy 
CISGN leads to a valid general theory on the cube-states. 
The arguments proceed along the following lines. Sup- 
pose that you have a product of two cube-states input 
into a two-cube linear operation such as a CSIGN. If we 
want the theory to be self consistent, we have to make 
sure that the output state from this process gives a valid 
probability distribution for the set of allowed X, Y, Z 
measurements. A CSIGN acts in the following way upon 
a product input state: 





/ 1 A B C \ 




/ 1 zA 


zB C \ 


1 


x xA xB xC 


1 


xC yB 


—yA x 


4 


V yA yB yC 


^4 


yC -xB 


xA y 




\z zA zB zC J 




\ z A 


B zC ) 



On such an output state, what is the probability of mea- 
suring Pauli product P®Q, and getting say the outcome 
+1 for P and -1 for Ql Well we use the usual Born rule: 

1 



which is just given by an appropriate sum of four el- 
ements from the matrix of coefficients representing the 
output state. So if for example P = X and Q = X, then 
we just take a linear combination of the top left corner 
matrix elements: 



-(1 + xC- zA-yB) 



(11) 



:tr{((J + P)®(J-Q))p} 



(10) 



If the theory is self consistent, then we need all expres- 
sions such as this to be valid probabilities, and so they 
cannot be negative. In Appendix II we show that without 
adding noise, the CSIGN gate acting on any state spaces 
larger than the Bloch sphere leads to negative values for 
such expressions. As cube-separability implies positive 
probabilities for Pauli measurements, this means that the 
noiseless CSIGN cannot be a cube-separable operation. 



NOISE MODELS AND CORRESPONDING 
BOUNDS FOR CUBE-SEPARABILITY. 

The arguments of Appendix II show that a CSIGN gate 
cannot be cube-separable, as it does not even lead to pos- 
itive output states (for the allowed Pauli measurements) 
when acting upon input product cube states. However, as 
we shall see, it can often be made cube separable by the 
addition of noise. In this section we consider a few stan- 
dard noise models on the CSIGN gate, and bound or com- 
pute the noise levels required to make it cube-separable. 
One might hope that because of example ([7]), there may 
be situations involving natural noise models for which the 
consideration of cube-separability may lead to tighter up- 
per bounds on non-classical computation thresholds. 

However, we shall see that in all the cases considered 
in this section the upper bounds derived are either the 
same or worse than those that can be derived by con- 
sidering quantum separability, or have not been proven 
to be lower by us. In fact for magic state architectures 
(which were the primary motivation for considering the 
cubes anyway) better bounds can in many cases be ob- 
tained using the GK theorem J32J . 

To show that a given (noisy) gate is cube-separable, 
one would have to show that all possible combinations 
of input product pure cube states (the corners of the 
cubes) are taken to output states that are cube-separable. 
However some of the noise models that we will consider 
have certain symmetry properties which mean that it is 
sufficient to consider only one input cube pure product 
state. In particular, suppose that we have a CSIGN gate 
U acting on a pair of pure cube states p and some CP 
map error E acting with probability A, such that under 
the total action of the noisy gate the output state is cube- 
separable: 

(1 - X)U(p) + XE(U(p)) e cube - separable 

Now suppose that E is such that it commutes with any 



local Pauli product unitary transformation Q, as well lo- 
cal products of phase gates S := |0)(0| + i|l)(l|, both 
acting by conjugation. Then it will be the case that: 

(1 - X)QU{p)Q f + XE{Q(U{p))Q f ) e cube - separable 
(1 - \)SU(p)S* + XE(S(U{p))S^) G cube - separable 

because both Q and S are Clifford unitaries and hence 
preserve the cubes. Now we can move Q through the 
CSIGN U to get another Pauli transformation P, and 
we can move S through the CSIGN simply because it 
commutes: 

(1 - X)U(PpP f ) + \E(U(PpP^)) e cube - separable 
(1 - X)U(SpS^) + XE(U(SpS^)) e cube - separable 

where P is some other Pauli product. Now by choosing 
sequences of P gates and S appro pria tely, we can take an 
cube extremal state to any other [33J . This hence means 
that if A is strong enough to make one particular input 
cube pure product state separable, it can make any cube 
state separable, provided that E satisfies the required 
commutation properties. If E corresponds to local or 
joint dephasing or depolarisation noise (as defined later), 
then it has these properties, and so for those noise models 
we shall only consider what happens to one particular 
product input cube-state. 

Despite this simplification the problem is still not easy 
in general, because deciding whether there is a LHV 
model for a given joint probability distribution is not 
straightforward [31[. In the relevant case of 2 parties, 3 
measurements, and 2 measurement outcomes, a complete 
but large list of Bell inequalities has been obtained [28| . 
However, we shall find that for most of the noise mod- 
els that we examine, the problem can be completely or 
almost completely solved analytically - in fact in most 
cases the key constraint is not one of locality, but of hav- 
ing a valid probability distribution for the outcomes. 

• Error-per-Gate noise on 2 particles under- 
going CSIGN. The minimal error rate A re- 
quired satisfies 50% > A > 20%. 

In this noise model we apply a perfect CSIGN and 
then afterwards with probability p apply a general 
quantum CP map of our choosing - we may be as 
adversarial as possible within the set of two-qubit 
CP maps. It can be seen that p =50% noise is 
sufficient. To see that it is sufficient, consider de- 
phasing one particle completely prior to performing 
the CSIGN gate - this corresponds to 50% Z noise 
on one arm. This takes that particle to a mixture 
of |0) and |1), which means that the second particle 
will be probabilistically rotated by Z. As Zs pre- 
serve the cube this means that the overall operation 
will be cube-separable. 

We may also show that at least 20% noise is nec- 
essary using the following argument. Consider the 



cube-extremal state given by Bloch vector (1,1,1). 
We may write this state as a linear combination 
of the density matrices T and T, which we define 
as the density matrices corresponding to the magic 
states |T) and \T) respectively. The expression is: 




= wT — (w — 1)T 

where wc define w := % ■ Similar expressions 
hold for the other cube cxtrcma, albeit with differ- 
ent magic states. So the tensor product of two of 
cube cxtrcma will be of the form: 



w 2 T A 



-w(w — 1)Ta 



Tb — w(w 



Iff A , 
- l)f A < 



T B 
Tb 



for two choices of magic state Ta , Tb ■ Hence any 
CP map acting upon the product of cube extrema 
inputs will give an expression of the form: 

w 2 pi + (w - l) 2 p2 - w(w - l)p 3 - w(w - l)p4 (12) 

where p\, .., pi are quantum states. Note that both 
w and w — 1 are positive, and so the maximum 
'probability' for a measurement outcome on equa- 
tion (|T2"j) will be no greater than: 



v? + (w - l) 2 = 2 



(13) 



Equation (fTTj) with the choice x = y = z = A = 
B = —C = 1 gives a negative probability of -1/2 for 
the pure CSIGN, so at the very least we need the 
noise to be sufficiently strong to make this positive. 
Now as the noise can at most balance this out by 
mixing in a 'probability' of w 2 + (w — l) 2 = 2, we 
require that: 



-1 



(i-^i — )+A(2)>0 



A > - = 20% 
5 



We have not been able to obtain an exact value for 
the error-per-gate noise required to make CSIGN 
cube-separable. However these values should be 
compared to the noise required for the CSIGN to 
become quantum-separable, which is 50% [3j. 

• Joint depolarising on output. A noise level 
of A > 2/3 is necessary and sufficient. 

Joint depolarising noise on a 2-cube operator acts 
is parameterised by a probability A and acts as 
p — > (1 — X)p + Atr{p}J/4 immediately after the 
action of an ideal CSIGN. In terms of the matrix 
representations that wc have been using above, this 



transformation is represented as A o — > A o , other- 
wise Aij^oo — >• (1 — X)Aij. It can be shown that 
A = 2/3 noise is both necessary and sufficient. 
Necessity follows from equation (fTTj) : picking in- 
put cubes with x = —C,z = A,y = B and then 
applying the depolarising noise, we find that re- 
quirement (fTTj) is equivalent to 1 — 3(1 — A) > 0, 
and hence A > 2/3. That this level of noise 
leads to a system with a LHV is shown in the ap- 
pendix by considering instead an input state where 
x = y = z = A = B = C = l. 

This is essentially the same as ordinary quantum 
case - as 2/3 total joint depolarisation is required 
to take an EPR pair to a separable state. Hence 
the consideration of cube separability leads to no 
reduction in the upper bounds. 

• Local depolarising noise. An error rate of 

p > 2 — y/2 ~ 60% is necessary and sufficient. 

Local depolarising of strength A acts as p — > (1 — 
\)p + Atr{p}//2 independently on each particle. 
In terms of the Pauli coefficients Ay this corre- 
sponds to A 00 -> A Q0 , A Qj -¥ (1- p)A 0j for j ^ 0, 
Aio -> (1 - p)A i0 for i ^ 0, and Ay ->■ (1 - p) 2 A l3 
otherwise. Applying condition (|TT1) for CSIGN act- 
ing upon input pure cube states with x = — C, z = 
A,y = B followed by such noise gives 

1 - 2(1 - p) - (1 - pf > => p > 2 - V2 - 60% 

That this level of noise leads to a system with a 
LHV is shown in the appendix by considering in- 
stead an input state where x = y = z = A = B = 
C = 1 - the relevant LHV is the last one presented 
in the appendix. 

This figure is worse than the local depolarising rate 
required to turn an arbitrary entangled pure 2 qubit 
state into a separable state, which using the PPT 
criterion [26j can be easily shown to be p = 1 — \= = 
42.2%. 

• Local dephasing noise. An error rate of p > 
1 h= ~ 30% is necessary and sufficient. 

Local dephasing of strength p implements a Z with 
probability p on each particle independently. In 
terms of the matrix representation that we have 
been using above, this is equivalent to Ay — > Ay 
whenever i G {0,3} and j € {0,3}, Ay —} (1 — 
2p) 2 Ay whenever i <G {1,2} and j € {1,2}, and 
otherwise Aij — > (1 — 2p)Ay. Applying condition 
dTTJ for CSIGN acting upon input pure cube states 
x = —C,z = A,y = B followed by such dephasing 
noise hence leads to the condition: 



That this level of noise leads to a system with a 
LHV is shown in the appendix by considering in- 
stead an input state where x = y = z = A = B = 
C = 1 - the relevant LHV is the penultimate one 
presented in the appendix. 

Unfortunately the value of p > 1 4r ~ 30% 

is no tighter than can be obtained by considering 
quantum-separability: it is precisely the same local 
dephasing rate as is required make the CSIGN a 
quantum-separable operation (we omit the details 
to show this, but it can be shown by considering the 
C J state of the noisy operation, which turns out to 
be locally equivalent to a separable Bell diagonal 
state in an encoded basis |0) = 1 00) , |1) = |11)). 



NOISY MEASUREMENTS OR PREPARATIONS: 
RESCALED BLOCH SPHERE. 

An alternative scenario (and a very realistic one) in 
which measurements can be restricted is when they are 
noisy. In this section we shall find that admitting noise 
on the measurements allows the CSIGN gate to become 
separable (with respect to the redefined state space) with 
slightly less noise than is required for quantum separa- 
bility. The differences are slight (Figs.©,©), but do re- 
veal some interesting behaviour. Firstly, one can consider 
adding noise to preparations instead of measurements, 
but we find that in contrast more noise is required. Sec- 
ondly, in the case of local dephasing noise we find that 
the noisy CSIGN cannot be a valid dynamical operation 
(in a generalised theory sense) for the non-quantum state 
spaces unless the dephasing is maximal. 

First let us establish the state spaces that we consider. 
Consider a given projective measurement {|e)(e|, |e)(e|}, 
and suppose that immediately prior to it the particle be- 
ing measured is depolarised with probability p. The fact 
that the measurement is noisy means that a longer Bloch 
vector can be admitted than in the noise free setting. A 
qubit matrix p (perhaps from outside the Bloch sphere) 
will be admissible for a noisy version of the measurement 
if 

P 



(l-p)tr{|e)(e|p} + ^>0 
(l_p)tr{|g><e|p} + J>0 



(14) 



1 - 2(1 - 2p) - (1 - 2pf > ^> p > 1 



1 
V2 



30% 



for all possible measurement directions. In the case of 
qubits this condition translates to the requirement that 
the Bloch vector v representing the state p be valid in 
the noisy setting if 

v = Rv' (15) 

defines a valid vector v' in the noise-free setting, where 
R is given by: 

1 



R 



1-p 



(16) 
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This simply means that the new noisy valid set is equal 
to the noise-free valid set enlarged by a factor R > 1. 
Actually the rescaling factor R can be used to describe 
noise in the preparation of qubits too: if we have R < 1 
then this corresponds to preparing qubits and then de- 
polarising them individually at a rate 1 — R (i.e. p — > 
Rp + (1 — R)I/2). Hence we have two types of noisy 
modifications of the Bloch sphere: if R > 1 we interpret 
this as depolarising noise just before the measurements 
at rate p = (R— 1)/-R, if R < 1 we interpret this as de- 
polarising noise in the qubit preparations at rate 1 — R. 
It will be convenient for us to use the following notation: 

Definition: Rescaling map. Let Tr denote the lin- 
ear map corresponding to rescaling parameter R. Pro- 
vided that R > this map is invcrtible, and we denote 
the inverse by T^ 1 

With these modifications of the single particle state 
space, our goal is to calculate the amount of noise re- 
quired before an entangling gate such as the CSIGN C be- 
comes non-entangling in the modified picture. The PPT 
criterion can be applied to answer this question, because 
a two-particle operator AB will be separable w.r.t. to 
an R rescaled single particle space iff T^ 1 (g> T^ l (AB) is 
both positive and PPT. Hence we need to work out the 
minimal noise N required before all the expression 



r, 



•T R l oNoCoT R ® T R (<j> ® if>) 



(17) 



is both positive and PPT for all possible input pure quan- 
tum product states 4> <E) ip- 

Appendix III gives some symmetry arguments that 
simplify this problem for local and joint depolarising 
models, and for local dephasing models. In all these cases 
numerical search enabled us the minimal noise to make 
the outputs positive and PPT. We expect from the nu- 
merics that the choices for cf> and ip that need the most 
noise to give output PPT and positive states can be cho- 
sen to be either 1/2(7 + X) or 1/2(7 + Z), however we 
have not yet been able to confirm this analytically. 

• Joint Depolarising. We numerically obtained 
the graphs Fig((2]) and FigQ for the tradeoff be- 
tween rescaling and noise on the CSIGN required to 
ensure rescaled-separability. The graphs show that 
at a rescaling of R = 1.73, corresponding to a lo- 
cal depolarisation rate of approx 42.2% before mea- 
surements, joint depolarisation rates on the CSIGN 
of 53.6%, are sufficient to ensure rescaled separabil- 
ity. Hence despite the fact that the noisy CSIGN 
gates are slightly entangling, the noise on the mea- 
surements is sufficient to eradicate any effective en- 
tanglement. 

• Local depolarising. We numerically obtained 
the graphs FigQ and Fig© for the tradeoff be- 
tween rescaling and noise on the CSIGN required 



1.5 
R 



FIG. 2: Joint depolarisation noise required to remove CSIGN 
entanging power w.r.t. rescaled Bloch spheres. R — 1 corre- 
sponds to no rescaling. We see that adding noise to the mea- 
surements in the device (R > 1) allows the CSIGN gate to 
become separable w.r.t. the rescaled state space with slightly 
less noise. 
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FIG. 3: Joint depolarisation with rescaled Bloch sphere. Here 
FigQ is replotted with R converted to the equivalent depo- 
larisation rate on either preparation (R < 1) or measurement 
(R > 1). The curve that starts lower corresponds to R > 1, 
the one starting higher to R < 1. The vertical axis labels the 
noise required on the CSIGN gate to remove its entangling 
power w.r.t. to the rescaled Bloch sphere. 



to ensure rescaled-separability. The graphs show 
that for rescaling of R = 1.16, corresponding to 
a local depolarisation rate of approx 13.8% be- 
fore measurements, a local depolarisation rate of 
39.5%, a is sufficient to ensure rescaled separabil- 
ity. Hence despite the fact that the noisy CSIGN 
gates arc slightly entangling (one can see from the 
figures that the gate noise is lower than the value at 
R — 1), the noise on the measurements is sufficient 
to eradicate any effective entanglement. 

• Local dephasing. Interestingly it can be shown 
that for R ^ 1 only complete local dephasing (i.e. 



11 



is easy to diagonalisc. Two of the eigenvalues are: 




FIG. 4: Local depolarisation noise required to remove CSIGN 
entanging power w.r.t. rescaled Bloch spheres. R — 1 corre- 
sponds to no rescaling. We see that adding noise to the mea- 
surements in the device (R > 1) allows the CSIGN gate to 
become separable w.r.t. the rescaled state space with slightly 
less noise. 



tfrvcv.'Vii'; //«;;/ O.yviW, 



FIG. 5: Local depolarisation with rescaled Bloch sphere. Here 
FigQ is replotted with R converted to the equivalent depo- 
larisation rate on either preparation (R < 1) or measurement 
(R > 1). The curve that starts lower corresponds to R > 1, 
the one starting higher to R < 1. The vertical axis labels the 
noise required on the CSIGN gate to remove its entangling 
power w.r.t. to the rescaled Bloch sphere. 



applying local Zs with independent 50% probabil- 
ity on each side) is sufficient to make the CSIGN 
separable w.r.t. rescaled cubes. If we choose 
(f> = 1/2(1 + X) and ip = 1/2(1 + Z) we find that 
the output of equation (|17[) is proportional to (we 
drop normalisation): 



J <g) J+ (1 - 2p)RX <E>I + I<E)Z+ ±- ^-X <E> Z 

R 



where p is the probability of locally applying a Z 
operator. All terms in this operator commute, so it 



(l-2p)(R--) 

(i-2 P )(i-ii) 



(18) 



which cannot both be positive if p ^ 1/2 unless 
R = 1. This is very much analogous to our later 
findings for local dephasing on rescaled cubes. In 
fact, the positivity of equations (|T5|) is necessary 
just for positivity for the output. Hence for R^\ 
we cannot even have a valid probabilistic theory 
unless the local dephasing on the CSIGN is total. 



RESCALED CUBES. 

The tradeoff investigated in the previous section can 
also be explored in the situation where measurements are 
both restricted in direction and noisy. In magic-state ar- 
chitectures it could for example be the case that a small 
amount of noise on the measurements or preparations 
could reduce the noise required to make the CSIGN gate 
separable w.r.t. the rescaled cubes. The noise models 
that we consider in this section are mixtures of Pauli 
operators, so by the arguments presented earlier it is suf- 
ficient to consider only one such input state. 

There are two values of the rescaling parameter that 
have particular significance: values of R > l/v2 repre- 
sent a rescaled cube that contain all the privileged pure 
'magic states' along the 'T' and 'H' directions [10(, and 
values of R > 1/V3 represent a rescaled cube that con- 
tain all the pure magic states in the 'T' directions. Hence 
we will pay special attention to R values of I/a/2 and 
1/VS. A rescaling value of R — 1/V3 also has further 
significance: it represents the local depolarising rate re- 
quired to disentangle (in the usual quantum setting) a 
Bell state. Hence as pure Bell states are cube-separable, 
this means that for 1 > R > l/v3 our device can in prin- 
ciple also have access to entangled Bell diagonal states. 



Local Depolarisation for rescaled cubes 

To analyse local depolarisation of rescaled cubes con- 
sider the sequence of transformations: (A) a rescaling by 
R > (corresponding to noise on the measurements or 
preparations), (B) followed by a CSIGN, (C) followed by 
a local noisy depolarisation operation (characterised by 
a rescaling r < 1), (D) followed by undoing the rescaling 
R. Let us pick two input pure cube states with Bloch 
vectors (x,y,z) and (A,B,C) to go into this sequence 
of transformations. This gives the following sequence of 
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operators: 



/ 1 
rR 2 xC 

rR 2 yC 

rRz 



/ 1 RA RB RC \ 

Rx R 2 xA R 2 xB R 2 xC 

Ry R 2 yA R 2 yB R 2 yC 

\ Rz R 2 zA R 2 zB R 2 zC ) 

( 1 R 2 zA R 2 zB RC \ 

R 2 xC R 2 yB -R 2 yA Rx 

R 2 yC -R 2 xB R 2 xA Ry 

\ Rz RA RB R 2 zC J 

rR 2 zA rR 2 zB rRC \ 
r 2 R 2 yB -r 2 R 2 yA r 2 Rx 
-r 2 R 2 xB r 2 R 2 xA r 2 Ry 



'-RA 



r 2 RB r 2 R 2 zC j 



( 1 tRzA 
rRxC r 2 yB 
rRyC —r 2 xB 



rRzB rC \ 

-r 2 yA r 2 x/R 



l xA 



2 y/R 



V 



l A/R r 2 B/R r 2 zC J 



Our goal is hence to work out for a fixed value of R 
the maximum r (i.e. the minimal local dcpolarisation) 
required before the last state in the sequence is cube sep- 
arable for a given choice of x, y, z, A 7 B 7 C G ±1. Let us 
pick x, y, z, A,B,C = +1. The final state in the sequence 
becomes 



rR 

,.2 



rR 



l /R 



\ 



( i 

rR 
rR 
\ r r 2 /R r 2 /R r 2 / 



(19) 



Again we need to make sure that all probabilities are pos- 
itive. We numerically identified the following two prob- 
abilities as being important. Consider the probability of 
getting the down outcome on both sides upon measuring 
X®Y: 



-(1 -r 2 -2rR) > 



\J\ + R 2 - R > 



(20) 



and the probability of getting the up on the first particle 
and down on the second particle upon measuring X g> Z: 



^ 



rR 



r . 
R^° 



R-l + ^jR-lf+i/R 
(2/7?) 



> r 



(21) 



Both inequalities are plotted in figure ([6]). The first 
bound decreases monotonically as R increases. Hence 
we know that choosing R > 1 cannot lead to a tight- 
ening of the bound that we have already obtained for 
R = 1. Hence we may restrict our attention to rescaling 
factors corresponding to R < 1. The peak r that satisfies 
both these inequalities corresponds to the intersection 
of the two curves, and is given by 1 — r = 39.2919%, 




FIG. 6: The horizontal axis corresponds to R, the vertical 
to r. The decreasing curve corresponds to equation (|20[) . the 
increasing curve to (|21|) . The valid region must lie below 
both these curves - above them gives negative probabilities of 
outcomes. The dotted line is the line R = r. 



at 1 — R = 47.9927%. However, the maximal r local- 
depolarising level at which the CSIGN is still quantum- 
separable is 1 — r = 1 7?— 42.2649%. Hence even if 

1 — r = 39.2919% could correspond to a valid LHV model 
(which we do not believe from numerical investigations) , 
the reduction in the fault tolerance upper bound would 
be marginal and comes at the cost of a fair amount of 
noise on the qubit preparations. 



We in fact believe from numerical investigation that 
equation (|20|) corresponds to a valid LHV model only for 
R >ss 0.5449335, corresponding to an r value of 1 — r = 
40.6095268%. However the R values of 1/V2, l/VS are 
covered by this region. At R = y/1/2 « 0.7071 we re- 
quire a local dcpolarisation rate of 1 — r = 1 — (V3 — 
1)/V2 « 48.24%, and at R = ^/T/3 w 0.5773 we require 
a local dcpolarisation rate of 1 — r = 1 — l/v3 « 42.27%. 
This last figure means that a stabilizer based computa- 
tion with access to pure magic states in the T-direction 
becomes classically simulatable if the CSIGN gates and 
Pauli qubit preparations are subject to 42.27% local de- 
polarisation. This figure is effectively worse than bounds 
obtainable from ordinary quantum separability: at local 
depolarisation rates of 1 — r = 1 — l/VS the CSIGN al- 
ready loses the ability to quantum entangle input qubit 
states. 
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Joint Depolarisation for rescaled cubes 

For a total joint depolarising noise model the equiva- 
lent of equation (|19[) is: 



(22) 



( 1 rR rR r \ 

rR r —r r/R 

rR —r r r/R 

\ r r/R r/R r I 



The probability of measuring down, down for a measure- 
ment of X <£> Y is given by: 



1 



(1 



^> r < 



2rR) > 
1 



2R+1 



(23) 



Numerical investigations helped us to identify this as a 
key inequality. Another useful (but not as important) in- 
equality is the probability of getting up, down upon mea- 
surement of Y (g> Z . This is given by: 



1 



(1-r- (r/R)+rR) > 
1 



^ r < 



1 + (1/i?) - R 



(24) 



Both these inequalities are plotted in ([7]) . Equation 
shows that we may only reach cube-separability with less 
noise if R < 1. Numerics show that the inequality (|23|) 
is achievable by a LHV model for values of R down to 
l-ii = l-4=w 29.29%, i.e. a noise value of: 



1 



=>A = 1- 



/2 + 1 



\/2+l 

58.58% 



(25) 



This value of R means that all pure magic states are 
admitted. Hence for a stabilizer based computation with 
access to pure magic states, a local depolarisation rate on 
Pauli preparations of 29.29% and a joint depolarisation 
rate of 58.58% on the CSIGN gates are sufficient to make 
the device efficiently classically simulatable. We can also 
allow the machine access to slightly entangled noisy Bell 
states: as Bell states are separable w.r.t. unsealed cubes, 
Bell states locally depolarised at a rate of 1 — R will be 
separable w.r.t. the re-scaled cubes. However, local de- 
polarisation rates of 1 — l/\/2 arc not sufficient (1 — 1/V3 
is required) to make a Bell state quantum separable. 




FIG. 7: The horizontal axis corrpesponds to R, the vertical 
to 1 — A = r, where A is the joint depolarisation rate. The 
decreasing curve corresponds to equation (|23l) . the increasing 
curve to (|24[) . The valid region must be below both these 
curves - above them gives negative probabilities of outcomes. 
The intersection point of the two curves is not achievable by 
a LHV model according to numerical investigations. The in- 
tersection point is given by (R,r) — (l/v 7 ^, v3/(2 + \/3)) ~ 
(0.58,0.46) 



of equation (|19[) is: 

/ 1 (l-2p)R (l-2p)R 1 \ 

(l-2p)R (l-2p) 2 -(l-2p) 2 (l-2p)/R 

(l-2p)R -(l-2p) 2 (l-2p) 2 {l-2p)/R 

1 (l-2p)/R (l-2p)/R 1 



V 



There is no LHV model that can match such a state 
- any product decomposition must have only Is in the 
corner elements, but then the (1,2) and (4,2) elements 
must be identical, which they are not if R ^ 1, unless the 
dephasing is total (i.e. p = 1/2). Another way of seeing 
this is that for measurements of Z <Ei X giving down, up 
or down, down the probabilities arc given by: 

^(l-l + (l-2p)R-(l-2p)/R)>0 (26) 

1(1 - 1 - (1 - 2p)R + (1 - 2p)/R) > (27) 

These inequalities cannot be satisfied if R ^ 1 unless the 
dephasing is total (i.e. p = 1/2). As these are conditions 
for a probability distribution, this means that no valid 
generalised theory can be constructed on R ^ 1 re-scaled 
Bloch-cubcs for a CSIGN with any partial amount of 
local dephasing. 



Local dephasing for rescaled cubes. 



CONCLUSIONS AND DISCUSSION. 



It turns out that for any R^\ the noisy CSIGN fails 
to be separable w.r.t. the rescaled cubes. The equivalent 



We have explored the consequences of redefining the 
single particle state space as the set of normalised 
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positive operators with respect to restricted measure- 
ments. This modification allows the consideration of non- 
quantum states and a different notion of entanglement, 
which can then be fed into classical simulation methods 
for noisy quantum computers that rely on limited entan- 
glement, such as |3|. 

Measurements restricted to Pauli axes give a 'cube' 
state space. We find that it is possible to write down 
noisy quantum gates that are (a) quantum-entangling 
and (b) non-Clifford operations, but are separable with 
respect to the modified cube state space. Hence the 
method appears to provide efficiently classically simulat- 
able regimes that are not covered straightforwardly by 
existing methods (all qubit states, including non-Clifford 
ones, are contained within the cube). The operations 
represented by the Choi-Jamiolkowski states in equa- 
tion ([TJ are also not entanglement breaking, and so they 
also can certainly generate multiparticle entanglement. 
However, it may be the case that repeated application 
of such operations cannot generate arbitrary long-range 
quantum entanglement. The precise differences in multi- 
particle quantum-entangling abilities of cube-separable 
operations are something that warrant further investiga- 
tion. Bell states are cube-separable and so in principle 
they can be included as a resource in the device, and this 
may help give the device an ability to generate multiparty 
entanglement. 

The approach has strong links to studies of gener- 
alised theories and the issue of non-locality and non- 
contextuality versus efficient classical simulation. Indeed 
most of the technical calculations presented in this paper 
can be seen as attempts to construct a non-quantum the- 
ory where the dynamics are taken from noisy quantum 
theory but the single particle states spaces are different to 
quantum. For some noise models we find that we cannot 
modify the quantum state space for anything less than 
maximal noise on the quantum dynamics (e.g. for the 
locally dephased CSIGN gate). 

One motivation for this work was to understand 
whether this approach can be useful for deriving up- 
per bounds to noisy non-classical computation thresh- 
olds. In relation to cube-separability, examples such as 
the operations represented by equation ([7]). and the fact 
that pure Bell states are cube-separable, suggest that for 
some noise models it should be possible to obtain tighter 
bounds. However, for machines restricted to Pauli mea- 
surements, and for the natural noise models that we have 



considered, there is little or no demonstrable reduction in 
the threshold upper bounds that can be obtained via the 
HN method or application of the Gottesman-Knill the- 
orem. Nevertheless, the approach may be of conceptual 
interest as it provides an alternative view of the non- 
classicality of magic-state quantum computers - instead 
of magic-state or conventional quantum entanglement be- 
ing viewed as the non-classical resource, the ability to 
generate non-local correlations or non-positive dynamics 
of the cubes is seen as essential to the non-classicality. 

In the case where measurements arc not restricted to 
a particular direction but they are slightly noisy, we 
have obtained curves showing the tradeoff between noise 
on measurements and the gate noise required to make 
the CSIGN separable with respect to the rescaled Bloch 
sphere. In these cases we do observe that adding small 
amounts of dcpolarisation noise to the measurements 
does require less noise on the CSIGN gates to induce 
separability, although the effect is slight (e.g. Fig. ([3] 
Fig.©)). However it is interesting to note that adding 
noise to qubit preparations does not have the same effect, 
in contrast it requires more noise on the CSIGN gates. 
It is also interesting to note that for rescaling factors 
R 7^ 1 a locally dephased CSIGN gate cannot give dy- 
namics in a valid generalised probabilistic theory unless 
the dephasing is maximal. 

Many of the questions raised can be naturally gen- 
eralised to higher dimensions: our restricted measure- 
ment sets have been constructed by selecting a single 
measurement and generating a full set using a group of 
single particle unitaries, and this process can extend nat- 
urally to higher dimensions. In particular when the sets 
of measurements are both tomographically complete and 
'operator compatible' it will also hold that non-classical 
computation requires the ability to generate non-local 
correlations. This applies to all fault tolerance schemes 
that use only generalised Pauli measurements in higher 
dimensions. 
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APPENDIX: RELEVANT LOCAL HIDDEN VARIABLE MODELS. 

In this section we mostly ignore normalisation factors, and represent product cube states by notation such as: 



1 . . . 

1 . . . 

\l . . .) 



(28) 



where the dots "." indicate that the element is the product of the element at the beginning of the row and top of the 
column. We construct a number of LHVs in sequence. Later LHVs often are built using earlier ones. Most of these 
models were developed by trial and error in order to match the inequalities needed for the output of the noisy CSIGN 
gate to give valid probabilities for Pauli measurements. 
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4. Joint depolarising noise LHV: 
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5. LHVs required for later local dephasing model: 
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6. LHV for local dcphasing. The following is a valid LHV provided that 1 — 2(1 — 2p) — (1 — 2p) 2 > is satisfied: 



/ 1 



2p 
2p 



(1 
(1 




(1 - 2p) 



2P) 2 ) 



(i - 2 P y 



/ 1 1 \ 
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7. LHV for local depolarising. The following is a valid LHV provided that 1 — 2(1 

1 

„-i2 (^ „\2 f-l „\2 

-P)-(1-Pf) 



((l-p)-(l-^) 




Z 1 



1 



p) 2 > is satisfied: 

0\ 





3(1 -py 



fj3 

1/3 
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1/3 

1/3 
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0/ 

1/3 1/3 \ 
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1/3 1/3 

1/3 1/3/ 



APPENDIX II: NOISELESS CSIGN IS NOT A CUBE-POSITIVE OPERATION FOR STATE SPACES 

BIGGER THAN THE BLOCH SPHERE. 

Let us pick input pure cube states x = 1,(7 = — l,z = A = l,y = B = 1 for a noiseless CSIGN gate. Applying 
this choice to equation (jlip gives -1/2, which is not a valid probability. So the theory is not self consistent if the 
CSIGN is admitted as a 2-cube operation acting on all cube states. This tells us that we would need to add noise 
to the CSIGN not only to make the output cube-separable, but even just to make the output correspond to a valid 
probability distribution when measured using Pauli operators. 

In fact the same argument can be made to go a little further. One might consider more general situations in which 
other sets of measurements (in addition to the X, Y, Z measurements) are used to define the set of allowed operators - 
perhaps by picking an additional projective measurement in some direction and considering what measurements would 
be obtained by allowing the Clifford group (or another group containing it) to act upon it by conjugation. This group 
could then be the allowed single qubit unitaries in a proposal for a quantum computer. The set of measurements 
obtained using this group would then define a set of 'states' that encloses the Bloch sphere. However, as the following 
argument shows, if this set of states is required to (a) contain the Bloch sphere within it, and (b) Pauli measurements 
are allowed, then the only consistent set allowing the CSIGN gate is the Bloch sphere itself. Equation (fTTj) gives the 
requirement: 



\* 



cC -yB- zA) > 



(29) 



as this needs to be a valid probability. But let us fix V = (C, —B, —A) to be any unit vector from the Bloch sphere 
(as our set of allowed states contains the Bloch sphere, we can do this, as A, B, C can be the components of any unit 
vector). Denoting v = (x,y,z), we have: 



1 + v.V > 



(30) 



But the only way that this can be true for all possible choices of V is if v is a vector of less than unit norm, i.e. the 
Bloch sphere. Hence we see that the CSIGN gate cannot output only positive states for any set of single particle 
states that is strictly bigger than the Bloch sphere, and hence cannot even be separable. 

APPENDIX III: RESCALED BLOCH SPHERE SYMMETRY SIMPLIFICATIONS. 



Consider the following sequence of transformations: (A) pick two pure quantum states from the Bloch sphere ift 
and </>, (B) rescale by a factor R > in accordance with the noise on the measurements or preparations, (C) apply a 
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CSIGN gate C, (C) apply the noise model N under consideration, (D) apply the inverse of the rescaling. The total 
sequence is: (T^ 1 (g) T^ 1 ) o N o C o (Tr ® Tft)(ip g) 0). Local unitaries after this transformation will not affect the 
positivity or PPT characteristics of the output. 

Let us consider applying local rotations about the Z axis on each particle. All noise models that we will consider 
commute (as CP maps) with the application of rotations about the Z axis - we will only consider local depolarising, 
dephasing, or joint depolarising. As local rotations about the Z axis also commute with T^ , Tr, and the CSIGN 
gate, this means the rotations can be pushed all the way through onto the input states. Hence we are free to pick ip 
and <f> up to arbitrary rotations about the Z axis. We will hence use this freedom to choose input pure states with no 
Y component. 

Under the noise models each Pauli product term in the expansion will acquire a factor < r < 1, which can in 
principle vary for different Pauli terms. However, for the purposes of the following argument we may simply set it 
to be constant, as it would be for the joint depolarisation noise model. Then the sequence of transformations will be 
represented as: 

/ 1 Rcos(cj)) i?sin(» ^ 

Rcos(6) R 2 cos(0) cos(0) R 2 cos(8) sin^) 



\Rsm{9) R 2 sin(0) cos(0) R 2 sin(0) sin(0) / 



— > 



t 1 

R 2 cos(0) sin(0) 

\ i?sin(6>) 

rR 2 cos(9) sm(cj)) 

\ rRsin(9) 

( 1 

rR cos(0)sin(0) 


\ r sin(0) 



R 2 sin((9) cos(0) 







Rcos((j)) 

rR 2 sin(0) cos(0) 









R 2 cos(0) cos(» 



Rsm(cj)) 

Rcos(9) 





\ 







R 2 sin(0) sin(^) / 



rR 2 cos(9) cos((/)) 



rR sin(0) 

rRcos(9) 





\ 



rRcos(cj)) 

rRsm(9) cos((j>) 





rR 2 sin(0) sin(</>) J 



r sin(0) 
icos(0) 



\ 



r cos(9) cos((f>) 

■^ cos(0) r sin(0) sin(^) y 



Partial transposition on the second system applies a minus sign to the third column, and so our goal is to show that 
for both choices of sign the operators: 



\ 



/ 1 ri?sin(6') cos(^) rsin(^) 

rRcos{9)sm((/)) ^ cos(0) 

±rcos(0)cos(0) ^ 

\ rsin(0) -^ cos(4>) r sm(9) sin(^) J 

are positive. A few observations concerning symmetries arc helpful in this regard: (a) adding 7r to 9 changes the sign 
of the sin(0) and cos(0). But it can be verified that this can also be achieved by applying a final local unitary Y Z. 
Similarly <j) changing by 7r is equivalent to applying Z <g> Y. As unitaries don't change the spectra, we may restrict 
our attention to 9, <f> £ [0, 7r], (b) similarly it can be shown that applying Z <Ei I is equivalent to changing A := 9 — ir/2 
to —A, and / Z is equivalent to changing A := <f> — n/2 to —A, and hence w.l.o.g. we may restrict attention to 

0, 4> e [o, tt/2] . 
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In any magic state circuit diagram each magic state will 
be next to or close to a CSIGN gate (or equivalent), and 
so the noise that we place on the CSIGN gates can be 
shifted onto the magic-states. The magic states will typ- 
ically become Clifford states (convex mixtures of Stabi- 
lizer pure states) with much less noise than required to 
make the CSIGN quantum- or cube- separable. This de- 
pends strongly upon the noise model and precise magic 
state however - for dephasing noise models for example, 
there are qubit states (close to Z eigenstates) that require 
maximal dephasing to become Clifford states. 
For instance starting with the cube Bloch vector (1, 1, 1) 
we can cycle through (1,1,1) — > x (1,-1,-1) — > Y 
(-1,-1,1) -> x (-1,1,-1) -^ S (-1,-1,-1) ^ X 
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